On Rolling Back and Checkpointing in Time Warp
نویسندگان
چکیده
ÐIn this paper, we present a family of three algorithms which serve to perform checkpoints and to roll back Time Warp. These algorithms are primarily intended for use in simulations in which there are a large number of LPs and in which events have a small computational granularity. Important representatives of this class are VLSI and computer network simulations. In each of our algorithms, LPs are gathered into clusters via algorithms which are application dependent. In order to examine the performance of our algorithms and to compare them to Time Warp, we made use of two of the largest digital logic circuits available from the ISCAS89 benchmark series of combinational circuits. The execution time, number of states saved, and maximal memory consumption were compared to the same quantities for Time Warp. Our results indicated that each of the algorithms occupies a different point in the spectrum of possible trade-offs between memory usage and execution time, ranging from substantial memory savings (at a comparable cost in speed) to memory savings and a comparable speed to Time Warp. Hence, an important benefit of our algorithms is the ability to trade off memory requirements with execution time. Index TermsÐParallel simulation, distributed simulation, distributed processing.
منابع مشابه
Dynamic Checkpoint Interval Selection in Time Warp Simulation
In Time Warp parallel simulation, a process executes every message as soon as it arrives. If a message with a smaller timestamp subsequently arrives, the process rolls back its state to the time of the earlier message and re-executes from that point. Clearly, the state of each process must be saved (checkpointed) regularly in case a rollback is necessary. Although most existing Time Warp implem...
متن کاملSelecting the Checkpoint Interval in Time Warp Parallel Simulation∗
In Time Warp parallel simulation, a process executes every message as soon as it arrives. If a message with a smaller timestamp subsequently arrives, the process rolls back its state to the time of the earlier message and re-executes from that point. Clearly, the state of each process must be saved (checkpointed) regularly in case a rollback is necessary. Although most existing Time Warp implem...
متن کاملBenefits from Semi-asynchronous Checkpointing for Time Warp Simulations of a Large State Pcs Model
Checkpointing overhead is a major obstacle for the effectiveness of Time Warp parallel discrete event simulators. Semi-asynchronous checkpointing is a recent solution to tackle this obstacle for Time Warp simulations on distributed memory systems based on Myrinet. In this solution, checkpoint operations are offloaded from the host CPU and are charged to a DMA engine on board of Myrinet network ...
متن کاملProbabilistic Checkpointing in Time Warp Parallel Simulation
In the Time Warp (TW) protocol, the system state must be checkpointed to facilitate the rollback operation. While increasing the checkpointing frequency increases the state saving cost, an infrequent scheme also escalates the coast forward eeort when a large number of executed events are redone. This paper proposes a prob-abilistic approach to checkpointing. We derive the roll-back probability,...
متن کاملA comparative study of state saving mechanisms for time warp synchronized parallel discrete event simulation
The state saving mechanism constitutes an essential part of any system in which erroneous or undesired executions can be undone by rolling back the system state. One such application is optimistically synchronized parallel discrete event simulation (PDES) systems based on the Time Warp Synchronization mechanism. In this type of systems the state saving and restoration mechanism is essential to ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IEEE Trans. Parallel Distrib. Syst.
دوره 12 شماره
صفحات -
تاریخ انتشار 2001